Key Expression driven Record Mining for Event Calendar Search
نویسندگان
چکیده
This paper presents an approach to extract data records from websites, particularly ones with event calendars. We therefore use languagespecific key expressions and HTML patterns to recognize every single event given on the investigated web page. One of the most remarkable advantages of our method is that it does not require any additional classification steps based on machine learning algorithms or keyword extraction methods; it is a so-called one-step mining technique. Our experimental results obtained on German opera websites show excellent results in precision and recall. Furthermore, we could demonstrate that our proposed technique outperforms other data record mining applications run on event sites.
منابع مشابه
Data-Driven Approaches to Improve the Quality of Clinical Processes: A Systematic Review
Background: Considering the emergence of electronic health records and their related technologies, an increasing attention is paid to data driven approaches like machine learning, data mining, and process mining. The aim of this paper was to identify and classify these approaches to enhance the quality of clinical processes. Methods: In order to determine the knowledge related to the research ...
متن کاملMining Program Workflow from Interleaved Logs
Successful software maintenance is becoming increasingly critical due to the increasing dependence of our society and economy on software systems. One key problem of software maintenance is the difficulty in understanding the evolving software systems. Program workflows can help system operators and administrators to understand system behaviors and verify system executions so as to greatly faci...
متن کاملEvent-driven and Attribute-driven Robustness
Over five decades have passed since the first wave of robust optimization studies conducted by Soyster and Falk. It is outstanding that real-life applications of robust optimization are still swept aside; there is much more potential for investigating the exact nature of uncertainties to obtain intelligent robust models. For this purpose, in this study, we investigate a more refined description...
متن کاملContinuous and incremental data mining association rules using frame metadata model
Most organizations have large databases that contain a wealth of potentially accessible information. The unlimited growth of data will inevitably lead to a situation in which it is increasingly difficult to access the desired information. There is a need to extract knowledge from data by Knowledge Discovery in Database. Data mining is the discovery stage of KDD whereas association rule is a pos...
متن کاملDuration of exclusive breastfeeding; validity of retrospective assessment at nine months of age
BACKGROUND In cross sectional, case control and retrospective cohort studies, duration of Exclusive Breastfeeding (EBF) usually depends on maternal recall. Retrospective data are often subjected to recall bias and could lead to a potential for exposure misclassification. The purpose of the present paper is to assess the validity of maternal recall of EBF duration during infancy, after cessation...
متن کامل